Incremental Connectivity-Based Outlier Factor Algorithm
نویسندگان
چکیده
Outlier detection has recently become an important problem in many industrial and financial applications. Often, outliers have to be detected from data streams that continuously arrive from data sources. Incremental outlier detection algorithms, aimed at detecting outliers as soon as they appear in a database, have recently become emerging research field. In this paper, we develop an incremental version of connectivity-based outlier factor (COF) algorithm and discuss its computational complexity. The proposed incremental COF algorithm has equivalent detection performance as the iterated static COF algorithm (applied after insertion of each data record), with significant reduction in computational time. The paper provides theoretical and experimental evidence that the number of updates per such insertion/deletion does not depend on the total number of points in the data set, which makes algorithm viable for very large dynamic datasets. Finally, we also illustrate an application of the proposed algorithm on motion detection in video surveillance applications.
منابع مشابه
Small Moving Targets Detection Using Outlier Detection Algorithms
Recent research in motion detection has shown that various outlier detection methods could be used for efficient detection of small moving targets. These algorithms detect moving objects as outliers in a properly defined attribute space, where outlier is defined as an object distinct from the objects in its neighborhood. In this paper, we compare the performance of two incremental outlier detec...
متن کاملComparative Study of Incremental Learning Algorithms in Multidimensional Outlier Detection on Data Stream
Multi-dimensional outlier detection (MOD) over data streams is one of the most significant data stream mining techniques. When multivariate data are streaming in high speed, outliers are to be detected efficiently and accurately. Conventional outlier detection method is based on observing the full dataset and its statistical distribution. The data is assumed stationary. However, this convention...
متن کاملRough K-means Outlier Factor Based on Entropy Computation
Many studies of outlier detection have been developed based on the cluster-based outlier detection approach, since it does not need any prior knowledge of the dataset. However, the previous studies only regard the outlier factor computation with respect to a single point or a small cluster, which reflects its deviates from a common cluster. Furthermore, all objects within outlier cluster are as...
متن کاملAdaptive Methods for Activity Monitoring of Streaming Data
Activity monitoring deals with monitoring data (usually streaming data) for interesting events. It has several applications such as building an alarm or an alert system that triggers when outliers or change points are detected. We discuss desiderata for such a system. Then, assuming that the data can be modeled by linear models, we describe an adaptive incremental method for detecting outliers ...
متن کاملAnomaly Detection over Concept Drifting Data Streams
Outlier detection over data streams has attracted attention for many emerging applications, such as network intrusion detection, web click stream and aircraft health anomaly detection. Since the data stream is likely to change over time, it is important to be able to modify the outlier detection model appropriately with the evolution of the stream. Most existing approaches were using incrementa...
متن کامل